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(A gentle introduction to noise filtering) 
by M. Tham 

INTRODUCTION 

When measurements are corrupted by random variations, they 
are said to be affected by noise. Since the standard deviation is 
a measure of spread in a data distribution, these random 
variations can be characterised by the standard deviation of the 
measured signal. That is, the larger the standard deviation, the 
noisier is the measurement. The procedure of reducing or 
attenuating the noise components of a measured signal is 
commonly known as filtering. 

There are many different ways to design filters, but the most 
common ones have their roots in simple averaging. 




The purpose of this set of notes is to 



• introduce some of the terminology used in the area of 
signal processing 

• show why signal averaging can reduce the effects of noise 

• introduce some filtering algorithms that are based on 
averaging 

• show that the low-pass filter that is commonly used in 
industry is equivalent to one form of averaging filter 
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(A gentle introduction to noise filtering) 
Averaging Filter 

Simple averaging can be used to reduce the effects of noise. 
Suppose we have n measurements of a variable x. The 
standard deviation of this measured variable can be 
estimated by: 



s = 



where J is the mean or average of the n measurements 
calculated as: 



x = — 



2> 

i-i 



The magnitude of s is clearly dependent on the measurements, 

x i, which in practice is bounded, that is it has lower and upper 
limits. However, s is also dependent on the number of 
measurements made, i.e. the number n. Thus for bounded 

values of x i, it can be deduced that the larger n is, the smaller 
s becomes. In other words, given a noisy but bounded 
measurement sequence, we can take a large number of 
readings of the variable and use its average to give a better 
estimate of its true value (provided there is no systematic 
error or bias in the measurements). This is actually standard 
procedure in experimental work, where a number of readings 
are taken at a sampling instant and the average of these 
readings used as the measurement. 



Although it is simple to calculate averages using the f ormula 
above , for online applications, it is inefficient both in terms of 
storage and computational requirements. This is because we 
need to: 



• store n data values 

• perform n additions and 1 division 



Given that computer storage and high speed microprocessors 
are very cheap nowadays, this may not seem to be a problem. 
However, the load on a process computer can become quite 
significant when you consider the fact that hundreds, and 
perhaps even thousands, of measurements are made on a 
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typical process plant. We therefore need to look at more 
efficient ways of calculating averages. 
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(A gentle introduction to noise filtering) 

Moving Average Filter 

A slight improvement in computational efficiency can be 
achieved if we perform the calculation of the mean in a 
recursive fashion. A recursive solution is one which depends on 
a previously calculated value. To illustrate this, consider the 
following development: 

Suppose that at any instant k, the average of the latest n 
samples of a data sequence, x * , is given by: 



1 ^ 



'i-k-x+l 



Similarly, at the previous time instant, k-1, the average of the 
latest n samples is: 



I tr-l 



Therefore, =■ _ ? 



[ % X k-X 1 



which on rearrangement gives: =?„.•= . _ * ] 

A k A k-l [ A k A k-« J 



This is known as a moving average because the average at 
each k'th instant is based on the most recent set of n values. In 
other words, at any instant, a moving window of n values are 
used to calculate the average of the data sequence (see Figure 
1). 
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Figure 1 Moving Window of n Data Points 



When used as a filter, the value of % is taken as the filtered 

value of x k. The expression is a recursive one, because the 

value of *k is calculated using its previous value, as 
reference. 

Compared to simple averaging , it can be seen that we need only 
perform 1 division, 1 addition and 1 subtraction operation. This 
is always the case, regardless of the number of data points (n) 
we consider. However, calculating the current filtered value 

requires the use of , i.e. the measurement n time-steps in 
the past. 

This means that: 

1. the filtering cannot be initiated reliably until n 
measurements have been made, and 

2. we need to store the value of x tr* which, depending on 
the way the algorithm is coded, may require up to n 
storage locations. 

Additionally, the technique places equal emphasis on all data 
points. Thus a value in the past will have the same influence as 
a more current measurement when calculating the filtered 
signal. This may be a desirable feature when the mean value of 
the measurement is almost constant, but not when the signal 
has a trend as most process measurements do. 

These problems can however, be reduced by generating the 
filtered value in a slightly different manner. 
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The moving average filter regards each data point in the data window to 
be equally important when calculating the average (filtered) value. In 
dynamic systems, however, the most current values tend to reflect better 
the state of the process. A filter that places more emphasis on the most 
recent data would therefore be more useful. Such a filter can be designed 
by following the procedure used in developing the moving average filter. 
As before, the starting point is the mean expressed as: 

** - - 2-* 

But in this case, consider also the mean with one additional point 



_L V 1 



*JU1 + 



i-fc-a+l 



Since ^jx t = nx k , therefore, 



By shifting the time index back one time-step, we obtain the 
corresponding expression for % as: 



n 



To simplify the notation, let o> = , which implies that (1 



-H-y 



We can write the filter as: 



ax 



fc-i 



+ (l-a)j 



This expression is known as the Exponentially Weighted Moving Average 
Filter. When used as a filter, the value of h is again taken as the filtered 
value of x k. Notice that now, calculation of % does not require storage of 
past values of x, and that only 1 addition, 1 subtraction, and 2 
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multiplication operations are required. 

The value of the filter constant, a , dictates the degree of filtering, i.e. 
how strong the filtering action will be. Since n> , this means that 
< a. < 1 . When a large number of points are being considered, a, 1 , 

and x k This means that the degree of filtering is so great that the 

measurement does not play a part in the calculation of the average! On 

the other extreme, if » -> , then % ~* x t which means that virtually no 
filtering is being performed. 

The Exponentially Weighted Moving Average filter places more importance 
to more recent data by discounting older data in an exponential manner 
(hence the name). This characteristic can be illustrated simply by 
describing the current average value in terms of past data. For example, 
since 

= + (l " ot)x k ( 

then 

Therefore, 

= + " <*)x k = <x[otx^ 2 + (1 - a)x h _ 1 ] + (1 - a)x k 

i.e. ^ = « 3 ^-a +a O-a)jfjw + (l-a)^ 

But = «*%3 + t 1 _ , 
Therefore, 

X k = a J [ai w + (1 - a)x^ 2 ] + a(l - + (l - a)x k 

= a 3 T w +a 2 (\-a)x^ 2 + a{l- a)x^ + (1 - <x)x k 

If we keep on expanding X terms on the right hand side, we will see that 

the contribution of older values of x i are weighted by increasing powers of 

ot. Since a is less than 1, the contribution of older values of x i becomes 

progressively smaller. The weighting on x i may be represented 
graphically by the following plot: 
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Figure 2. Exponential weighting effect 

What this means is that in calculating the filtered value, more emphasis is 
given to more recent measurements. 

The Exponentially Weighted Moving Average filter is arguably the most 
commonly used noise reduction algorithm in the process industries. 
However, it is known commonly by a another name; one that has its 
roots in electrical circuitry that are used to produce smooth electrical 
signals. 
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(A gentle introduction to noise filtering) 

The 1st-order Low-pass Filter 

The Exponential! y Weighted Movi ng Average filter is identical to 
the discrete first-order low-pass filter - a commonly available 
feature in most process computers. The objective of this section 
is to show their equivalence. 

Consider the Laplace transfer function of a first-order low-pass 
filter, with time constant 7 f . 



1 



l+T f S 



which relates the filtered signal x(_s) to the measurement x ( s ) . 
This has the following time domain equivalent: 



dx(t) 



+ x(t) = x(t) 



( Note : this differential equation can also be used to describe the input and the 
output behaviour of an electrical RC-circuit) 

Now, the differential equation can be discretised using the 
approximation: 



dx(£) 
dt 



T 



where T s is the interval between each measurement, i.e. the 

sampling interval. Thus the differential equation representing 
the first-order low pass filter is converted to: 



Simplification and re-arrangement gives: 



z 



By letting 
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a = 



(1-0!) 



z 



then 



x k = our w + (1- a)** 



which is identical to the Exponentially Weighted Moving Average 
f il te r. 
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(A gentle introduction to noise filtering) 
Choice of Filter Constants 

The degree of noise attenuation using the E xpon entially Weigh ted 
Moving Average and the first-order low-pass filte r is determined 
by the parameter ot . 

With the Exponentially Weighted Moving Average algorithm, a is 
determined by, n, the number of data points being considered in 
the calculation, i.e. 



n 



a 



8+1 



In the first-order low-pass filter, is determined by T , the filter 
time-constant, i.e. 



a = 



Since a = 



n 



8+1 



and a = 



, it is obvious that n and t / 



must be related. After some algebraic manipulation, the exact 
relationship can be worked out to be 



Based on the previous discussion on the effects of a on the 
degree of filtering, it follows that the larger the filter time- 
constant, the higher the degree of filtering. This is illustrated in 
the diagrams below. 



Figure 3 shows the results of sending a signal of unit step (dashed 
line) to a low-pass filter with different values of a, while Figure 4 
shows the results when a noisy step input is sent to the same set 
of filters. 
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Figure 4 Outputs of first-order filters to a noisy step input 

These plots reveal that noise attenuation is good when <x is set to 
a high value, but this is achieved at the expense of large lags on 
the filtered signal. If the filtered signals are used for feedback 
control, then excessive lags can severely degrade the stability 
margins of the closed loop system. On the other hand, the filter 
with a value of a= 0.3 hardly removed the noise. From 
experience, a value of a= 0.7 seems to give good performances, 
and this is equivalent to choosing n « 2 or ^ * 27^ . 
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The fact is that a compromise has to be made when selecting a 
value for o> and is the principal noise filter design objective; 
achieving sufficient noise attenuation with minimum processed 
signal lags. That however, is another story. 
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(A gentle introduction to noise filtering) 
Frequency Characteristics 



Low-pass filters allow the low frequency components of an input 
signal to pass through while attenuating (reducing) high 
frequency components, hence the term low-pass. There are 
other filters, such as band-pass and high-pass filters, where 
the classification is based on the frequency ranges that the filter 
allows to pass through. 

Measurement noise fall into the high frequency range of the 
signal spectrum, while the underlying process signal usually lies 
towards the low frequency end. Thus filters that are used to 
remove noise from measurements are of the low-pass types. 
The exponen tia lly weighted moving average filter (or 
equivalently the lst-order low-pass filter ) are but one of many 
possible types of low-pass filters. 



A signal can be decomposed into components of different 
frequencies. Thus, one method of examining the capabilities of 
filters is to look at what happens to inputs of different 
frequencies when they are passed through the filter. There are 
two features of the output signals that we can investigate: 



L what are the differences between the magnitudes of the 
input and the output signals (given by the amplitude 
ratio) and 

2. whether there is a time lag between input and output 
signals (given by the phase-shift) 



Plots of amplitude-ratio and phase-shift against frequency gives 
the frequency response of the system. For simplicity, in this 
set of introductory notes, we will consider only the amplitude 
ratio characteristics of low-pass filters. 



The amplitude ratio can be regarded as a frequency dependent 
gain of the filter. The job of the low-pass filter is to filter out 
high frequency components, therefore its amplitude ratio should 
be low at high frequencies. At low frequencies, the low-pass 
filter should allow the input signal to pass through undistorted, 
and so its amplitude ratio at the low frequency end of the signal 
spectrum should be unity. The red lines in the figure below 
show the frequency dependent amplitude ratio plots of 3 first- 
order low-pass filters, with time-constants 10, 20 and 30. 
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Figure 5. Amplitude ratio plots of lst-order low-pass 
filters with differen time-constants 

Note that the amplitude ratios in the above figure are expressed 
in terms of decibels (dB). 

To convert a number, x, to dB, simply apply a log and multiply 
the result by 20, that is, aologmfx) ■ Therefore dB is 

equivalent to an amplitude ratio of 1, while a large negative dB 
indicates that the amplitude ratio is very small. 

From Figure 5 , we can see that first-order filters are capable of 
attenuating high frequency components. In particular, the larger 
the time-constant of the filter, the higher the degree of filtering . 
This is easily discerned by examining the amplitude ratios of the 
three filters at any particular frequency (see the three vertical, 
dashed blue lines in Figure 5). 

A measure of the efficiency of a filter is its bandwidth. This is 
defined as the frequency range of a signal that a filter allows to 
pass through with minimal attenuation; signal attenuation is 
considered to be significant when the amplitude ratio is less 
then -3dB, (approximately 0.7 as a ratio). The -3dB line is the 
horizontal magenta coloured line in Fi gure 5 , from which it can 
be seen that the bandwidth of a filter decreases with increasing 
values of filter time-constants. 

Low-pass filters need not be limited to first-order types. Figure 
6 shows the frequency responses of 3 low-pass filters with the 
following Laplace transfer functions: 



1 



First-order low-pass filter 
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Second-order low-pass 
filter 



Third-order low-pass filter 



1 



(1 + 10*) 2 
1 

(1+ 10s) 3 



The second and third-order filters are respectively, implemented 
by placing 2 and 3 first-order filters in series. 




10" 10 10 

Frequency ;rads/tim& i 
Figure 6. Amplitude ratio plots of 1st, 2nd and 3rd-order 

low-pass filters 



We can see from Figure 6 that as the order of the filter 
increases, the slopes of the respective amplitude ratio plots 
becomes steeper. What this indicates is that higher order low- 
pass filters provide higher rates of signal attenuation. From the 
-3dB line (magenta line), it is also clear that the bandwidth of a 
filter decreases with increasing order. Thus a higher degree of 
filtering can be achieved by employing higher order filters. 



Theoretically, we can design a filter such that its amplitude ratio 
beyond some frequency, q c , is a vertical line, while the 

amplitude ratio below this threshold frequency is horizontal at 
dB. A filter with this kind of frequency response characteristics 
is called an ideal low-pass filter, and ro is called the cut-off 

frequency; that is any signal with frequency components 
higher than this will be completely removed by the filter. Thus, 
the bandwidth of the ideal filter is equal to its cut-off frequency. 



As mentioned previously, detrimental jags are introduced into 
the processed signal by low-pass filters. This aspect can also be 
studied by plotting the filter's frequency dependent phase shift 
characteristics. However, the effects are not as graphic as time- 
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response plots, and so phase-shift plots are not covered in this 
set of introductory notes. Those who are interested can find 
further details in standard signal processing texts, or read the 
appropriate section of the notes on Applications of Frequency 
Response (requires the free Acrobat Reader). 
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